About this Journal  |  Author Guidelines  |   Submit a Manuscript     

International Journal of Internet of Things and its Applications

Volume 1, No. 1, 2017, pp 13-28
http://dx.doi.org/10.21742/ijiota.2017.1.1.02

Abstract



Increase the Performance of K-Means Clustering Algorithm Using Apache Spark



    Chang Xie
    Harbin University of Commerce, China

    Abstract

    Big data deals with large or complex traditional data. The term often refers to size and data. Big data presents a great challenge for database and data analytics research. It is used to get the predictive analysis from large data. It helps in decision making, and to take better decisions based on the given data. This paper consists of comparison between Hadoop Map Reduce and Apache Spark which are used for analyzing Bigdata. Even though both the frameworks are based on Bigdata, their performances differ from level to level and implementation also. In this paper we compare the performance of these both frameworks using k-means clustering algorithm.


 

Contact Us

  • PO Box 5074, Sandy Bay Tasmania 7005, Australia
  • Phone: +61 3 9028 5994